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Abstract 

Background: The tunnor suppressor Rb/E2F regulates gene expression to control differentiation in nnultiple tissues 
during developnnent, although how it directs tissue-specific gene regulation in vivo is poorly understood. 

Results: We determined the genome-wide binding profiles for Coenorhobditis elegons Rb/E2F-like components in 
the germline, in the intestine and broadly throughout the soma, and uncovered highly tissue-specific binding 
patterns and target genes. Chromatin association by LIN-35, the C elegons ortholog of Rb, is impaired in the 
germline but robust in the soma, a characteristic that might govern differential effects on gene expression in the 
two cell types. In the intestine, LIN-35 and the heterochromatin protein HPL-2, the ortholog of Hpl, coordinately 
bind at many sites lacking E2F. Finally, selected direct target genes contribute to the soma-to-germline 
transformation of lin-35 mutants, including mes-4, a soma-specific target that promotes H3K36 methylation, and 
csr-1, a germline-specific target that functions in a 22G small RNA pathway. 

Conclusions: In sum, identification of tissue-specific binding profiles and effector target genes reveals important 
insights into the mechanisms by which Rb/E2F controls distinct cell fates in vivo. 



Background 

The Rb/E2F transcriptional complex is a major regulator 
of developmental and cellular fates. Underscoring its 
importance, the pocket protein Rb acts as a key tumor 
suppressor protein in cancers of diverse tissue origin 
(reviewed in [1]). Rb acts in large part by regulating the 
activity of E2F, a heterodimeric sequence-specific DNA 
binding factor composed of an E2F and DP subunit In 
mammals, these factors are members of gene families: 
there are at least eight E2F-related factors, three DP- 
related factors, and three pocket proteins. These family 
members exhibit considerable redundancy and compen- 
sation. Moreover, a particular family member can either 
promote or inhibit tumorigenesis in a cell type-depen- 
dent manner (reviewed in [2]). This complexity has 
greatly hampered a mechanistic understanding of how 
the Rb/E2F pathway acts in vivo. To date, the only gen- 
ome-wide chromatin immunoprecipitation (ChIP) 
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analyses of mammalian Rb/E2F have been performed in 
tissue culture, often in transformed cell lines (for exam- 
ple, [3,4]). While valuable, the resulting global DNA 
binding profiles of Rb and E2F can be correlated only 
indirectly with tissue-specific phenotypes and ultimately 
with tumorigenesis. 

The nematode Caenorhabditis elegans provides an 
excellent system in which to directly investigate the 
function of Rb/E2F in vivo. Relative to mammals, its Rb/ 
E2F pathway is very streamUned, with only one Rb-like 
pocket protein (LIN-35), one DP-like protein (DPL-1), 
and three E2F-related proteins, of which EFL-1 exerts 
the broadest effects in the animal [5,6]. As in mammals, 
these factors are broadly expressed and play diverse 
roles in different tissues. They are part of a gene regulatory 
pathway known as SynMuv B that mediates differentiation 
of various somatic tissues, including the vulva, intestine, 
and pharynx (reviewed in [7]). A recent report used 
genetic, biochemical and gene expression data to place 
members of the SynMuv B pathway into three functionally 
distinct complexes', the DRM, heterochromatin, and Mec/ 
Sumo complexes [8]. These three complexes contribute 
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differentially to various SynMuv B phenotypes, potentially 
by selectively regulating subsets of target genes. LIN-35, 
EFL-1 and DPL-1 are members of the DRM complex. 

A major function of the SynMuv B pathway is to pre- 
vent somatic tissues from adopting characteristics of the 
germline fate, such as ectopic expression of germline 
genes, enhanced response to RNA interference (RNAi), 
and increased transgene silencing [9]. This soma-to- 
germline transformation is also associated with dis- 
rupted intestinal function and larval arrest at high tem- 
peratures [10]. Intriguingly, certain members of both 
DRM and heterochromatin complexes, such as lin-35 
and hpl-2, respectively, are required for the high tem- 
perature arrest, while other members of the two com- 
plexes, such as efl-1 and lin-61, are not [10], suggesting 
tissue-specific formation of the SynMuv B complexes 

Additionally, components of the SynMuv B pathway 
act differently in the germline compared to somatic tis- 
sues. For instance, in lin-35 mutants, germ cells exhibit 
impaired proliferation but can still undergo gametogen- 
esis and fertilization; as a consequence mutants are fer- 
tile but have decreased brood size. By contrast, efl-1 and 
dpl-1 mutants display severe defects in oogenesis, ovula- 
tion, and fertilization, and are sterile [5,11]. All of these 
data indicate that LIN-35, EFL-1 and DPL-1 control a 
fundamental developmental choice between 'immortal' 
germline and differentiated soma. However, an under- 
standing of the tissue-specific relationships between 
these proteins and their targets remains unclear. In par- 
ticular, it is essential to determine whether these pro- 
teins directly influence many target genes or a few 
master regulators to determine these fates. To under- 
stand mechanistically how LIN-35, EFL-1 and DPL-1 
mediate their diverse effects in different contexts, we 
have chosen to identify the target sites for these factors 
in multiple tissue types. 

Genome-wide analysis of transcription factor binding 
in C. elegans has so far only been carried out using 
whole animals as the source material [12-14]. In particu- 
lar, one recent study identified binding sites for another 
DRM component, LIN-54, in whole animals with all 
developmental stages combined [15]. While providing 
insight into the organismal function of the SynMuv 
pathway, this and other studies to date have masked cell 
type-specific binding events for broadly expressed fac- 
tors with diverse functions, such as Rb/E2F. 

To address this limitation, we selectively expressed epi- 
tope-tagged LIN-35, EFL-1, and DPL-1 in the germUne, 
intestine, and throughout the soma. We also expressed 
the SynMuvB heterochromatin complex protein HPL-2 
(HPl-like) in the intestine. With these strains, we pro- 
filed chromatin interactions genome-wide and identified 
binding sites for each factor in each tissue that define 
sets of tissue-specific target genes with distinct properties 



and functions. Strikingly, most EFL-l/DPL-1 binding 
sites in the germline exhibit little to no LIN-35 binding, 
and LIN-35 binding is impaired overall in the germline 
relative to the soma. Conversely, in the intestine, LIN-35 
binding is robust, and a subset of sites co-bound by HPL- 
2 but not EFL-l/DPL-1 exhibit unique properties. Our 
data suggest that LIN-35/EFL-l/DPL-l most likely inhi- 
bits the germline fate in somatic tissues by directly acting 
on a few key targets rather than on many hundreds of 
individual genes. In sum, these tissue-specific binding 
profiles lead to insights into tissue-specific properties of 
Rb/E2F function. 

Results 

Tissue-specific binding profiles for LIN-35, DPL-1, EFL-1 
and HPL-2 

We generated a series of tissue-specific transgenes con- 
taining the efl-1, dpl-1 or lin-35 genomic locus with a 
GFP:FLAG epitope tag inserted in frame at the carboxyl 
terminus of each gene, followed by the native 3' UTR 
(Figure SIA in Additional file 1). Different regulatory 
sequences were used to restrict transgene expression in 
the germline (pie-1 regulatory sequences), intestine (ges-1 
regulatory sequences), or broadly throughout diverse cell 
types (endogenous lin-35, efl-1, or dpl-1 regulatory 
sequences). We also tagged hpl-2 and expressed it in the 
intestine, where it has a demonstrated genetic interaction 
with lin-35 and plays a role in the high temperature larval 
arrest phenotype [10,16]. 

We produced integrated transgenic strains expressing 
each GFP-tagged protein, all of which localized to nuclei 
in the expected tissue(s) but not elsewhere (Figure SIB 
in Additional file 1). Several transgenes were tested for 
rescue of mutant phenotypes (Figure S2A-E in Addi- 
tional file 1; Supplemental Materials and methods in 
Additional file 1). For example, endogenous LIN-35, 
which is expressed in both the soma and the germline, 
rescued the somatic lin-35 mutant phenotype of high- 
temperature larval arrest [10], as well as the germline 
phenotype of reduced fertility [11]. By contrast, germline 
LIN-35 rescued the reduced fertility, but not the somatic 
larval arrest, demonstrating tissue-specific function. In 
sum, the rescue experiments are consistent with tissue- 
specific activity of the transgenic proteins. 

We then selected an appropriate developmental stage to 
characterize DNA binding for each factor in each tissue. 
The young adult stage is optimal for germline-expressed 
factors because animals at this stage have a fully developed 
germline with ongoing oogenesis but few embryos. We 
selected the larval LI stage to analyze both endogenous- 
expressed and intestine-expressed factors, because animals 
at this stage have primarily somatic tissues (only two 
quiescent germ cells are present) [17,18], and the soma- 
to-germline transformation is best characterized at the LI 



Kudron et al. Genome Biology 2013, 14:R5 
http://genomebiology.com/201 3/1 4/1 /R5 



Page 3 of 1 7 



stage [10,16]. Thus, the data sets corresponding to the 
endogenous-expressed and intestine-expressed factors will 
be referred to as somatic' and 'intestinal', respectively. 

Biological replicates of synchronized populations of 
each strain at the selected stage were subjected to ChIP 
using an anti-GFP antibody, followed by lUumina deep 
sequencing [12,14]. Several example binding profiles are 
shown in Figure la. Reproducibility between biological 
replicates was > 90%, except for germline LIN-35 (73%) 
and intestinal HPL-2 (65%) (Figure S3A in Additional 
file 1), which showed distinct binding profiles from the 
other factors. As an additional control, we performed 
Chip on wild-type LI animals using an antibody to the 
endogenous EFL-1 protein. The binding profile of endo- 
genous EFL-1 was remarkably similar to that of somatic 
EFL-1 (98% overlap; Figure S3A, B in Additional file 1). 
This comparison demonstrates that transgenic expres- 
sion does not result in extensive ectopic binding, vali- 
dating our approach. 

We identified genome-wide binding sites for each fac- 
tor in each tissue using PeakSeq (q < 0.001) [19]. The 
number of sites ranged from as few as 688 (germline 
LIN-35) to as many as 4,055 (somatic LIN-35) (Addi- 
tional file 2). Consistent with the expectation that LIN- 
35, EFL-1, and DPL-1 act in a complex, binding sites of 
these factors exhibit extensive overlap in each tissue 
(Figure S4 in Additional file 1; Additional file 2), greater 
than for an unrelated transcription factor such as ALR-1 
(data not shown) [14]. 

Tissue-specific gene targets have distinct properties and 
functions 

Each factor clearly exhibited tissue-specific binding 
events (Figure S4 in Additional file 1). We formally 
defined mutually exclusive sets of tissue-specific binding 
sites using criteria based on known or expected func- 
tions of, and relationships between, factors in each tis- 
sue as briefly outlined below (see Supplemental 
Materials and methods in Additional file 1 for additional 
rationale for criteria). 

Germline-specific sites (415) were bound by both 
germline EFL-1 and germline DPL-1 but not somatic 
DPL-1. We did not require binding by germline LIN-35 
because it displays weak binding in the germline (see 
below). Somatic DPL-1 had very strong binding and was 
used to exclude binding sites not specific to the 
germline. 

Soma-specific sites (282) were bound by somatic EFL-1, 
somatic DPL-1 and somatic LIN-35, but not germline 
EFL-1 or intestinal HPL-2. Somatic LIN-35, DPL-1, and 
EFL-1 showed very coordinated binding; thus, we included 
binding by all three factors. Exclusion of germline EFL-1 
sites eliminates those also bound in the germline, while 
exclusion of intestinal HPL-2 sites removed many known 



to be non-specific 'HOT' sites [13]. These sites could be 
occupied in one or more somatic cell types. 

Intestine-specific sites (656) were bound by both 
intestinal LIN-35 and intestinal HPL-2 but not somatic 
EFL-1 or intestinal EFL-1. This class was defined by our 
observation of binding sites with unique characteristics 
in the intestine that lacked EFL-1 binding but had 
highly coordinated LIN-35 and HPL-2 binding. 

Broadly bound sites (1,419) were bound by germline 
EFL-1, intestinal DPL-1, and somatic LIN-35. These 
three factors exhibited the strongest binding of any fac- 
tor in each tissue and were therefore selected to identify 
binding sites expressed in both the germline and at least 
one somatic tissue (the intestine), and possibly other 
somatic tissues as well. 

Examples of each category are displayed in Figure la. We 
independently validated a subset of germline-specific and 
soma-specific sites by ChlP-quantitative PCR (qPCR), con- 
firming tissue-specific binding for 11 of 12 (Figure S5 in 
Additional file 1). Binding sites in each category were then 
associated with candidate target genes whose transcript 
start sites were either less than 500 bp from the binding 
site (high confidence targets), or between 500 and 2,000 bp 
from the binding site (low confidence targets). Sites more 
than 2,000 bp from the start site of any known gene were 
left unassigned (Additional file 3) [14]. Most were assigned 
with high confidence to one or more coding genes; how- 
ever, the intestine-specific dataset exhibits a relatively high 
fraction of unassigned binding sites (Figure lb). 

To assess the sensitivity of this tissue-specific approach, 
we compared the tissue-specific datasets with those identi- 
fied in whole animal, mixed stage ChlP-chip experiments 
for LIN-54, another SynMuvB component that is expected 
to share many binding sites with Rb/E2F components as 
part of the DRM complex [15]. Strikingly, only 9% of the 
intestine-specific and 11% of the germline-specific targets 
were identified in the LIN-54 study, compared to 60% and 
41% of the broadly bound and soma-specific targets, 
respectively (Additional file 4). The tissue-specific profiles 
identif)^ hundreds of new binding sites, in addition to per- 
mitting the assignment of many binding events to a parti- 
cular cell type. 

LIN-35, EFL-1 and DPL-1 are expected to regulate genes 
with germline expression in both the germline and soma 
[11,20]. We used published germline expression data [21] 
to assess the fraction of germline-expressed genes for each 
set of targets, and found that germline expression is over- 
represented among the germline-specific, soma-specific, 
and broadly bound targets, but not the intestine-specific 
targets (Figure Ic). Enrichment for binding at germline- 
expressed genes in the soma-specific and broadly bound 
datasets is consistent with the ability of lin-35 to repress 
the germline fate in somatic tissues [9]. Additionally, the 
germline-specific, soma-specific, and broadly bound 
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Figure 1 Genes with tissue-specific binding by LIN-35, EFL-1 and DPL-1 exhibit unique properties, (a) Single-gene examples of the typical 
binding profile for each tissue-specific dataset. The key for each factor and tissue is to the left of the tracks. One track is shown for each factor in sets 
corresponding to each tissue, with a control (input) sample for each tissue (black). Red, germline-specific promoter; blue, endogenous promoter; 
orange, intestine-specific promoter, (b) Graph showing the fraction of binding sites for each tissue-specific dataset not readily assignable to at least 
one nearby coding gene, (c) The fraction of candidate gene targets with germline-intrinsic or oogenesis-enriched expression based on [21] (about 
0.105 of genes in the genome, marked by black line). Bars marked with an asterisk have significant over-representation {P < 1.8 x e-15 or lower; 
hypergeometric probability test), (d) The chromosomal distribution of candidate gene targets for each tissue-specific dataset. Statistically significant 
deviations from the expected value of 1 (marked with a black line) are indicated by an asterisk {P < 1.0e"°^ Pearson's chi-square). (e) Gene Ontology 
analysis of candidate target genes in each tissue, with Gene Ontology category 'molecular process', and the extent of enrichment indicated by the bar. 
Up to ten categories, all with more than two-fold enrichment and a P-value < 0.05, are shown. Redundant categories were removed manually for each 
tissue. Full analysis available in Additional file 5. 
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candidate target genes are strikingly under-represented on 
the X chromosome, whereas intestine-specific targets are 
substantially enriched on the X chromosome (Figure Id). 
This observation is consistent with the fact that relatively 
few germline-expressed genes are located on the X, and 
the X chromosome is poorly expressed in most germ cells 
[22,23]. Thus, LIN-35, EFL-1, and DPL-1 primarily bind 
near germline-expressed genes, with the exception of the 
intestine-specific sites. 

Despite having the common characteristic of germline 
expression, the genes in the tissue-specific datasets have 
strikingly different predicted functions, based on Gene 
Ontology (GO) categories (Figure le; Additional file 5). 
The germline-specific candidate targets include many 
whose functions have been implicated in oogenesis, fer- 
tilization, and embryonic patterning, consistent with the 
known germline phenotypes of efl-1 and dpl-1 mutants 
[11,24]. For example, multiple genes in this dataset med- 
iate chitin and chondroitin biosynthesis and have been 
implicated in eggshell formation, including cpg-2, cpg-3, 
cpg-4, gna-2, cbd-1, chs-1, and four C-type lectin genes. 
Soma-specific candidate targets function in fundamental 
cellular processes that occur in somatic tissues as well 
as in the germline, such as splicing, translation, and pro- 
teolysis. The broadly bound candidate gene targets tend 
to function in cell cycle-related processes such as mito- 
sis and replication, similar to the best-studied Rb/E2F 
targets in mammalian systems [3]. Finally, the intestine- 
specific set is enriched for genes involved in cellular 
metabolism, such as fatty acid biosynthesis and glucose 
metabolism, as well as the unfolded protein response in 
the endoplasmic reticulum. Cumulatively, these results 
demonstrate that the sets of tissue-specific binding sites 
correspond to target genes with fundamentally distinct 
properties. 

LIN-35 exhibits reduced binding in the germline relative 
to somatic tissues 

A notable feature of our data was marked reduction of 
DNA binding by germline LIN-35 compared to somatic 
tissues (Figure 2a). The relatively few germline LIN-35 
binding sites are also bound by germline EFL-1 and/or 
DPL-1, but are much weaker (median q-value = 6.3e'^^ 
for LIN-35, compared to 6.4e'^^ for DPL-1 and 5.2e'^^ 
for EFL-1; Figure S4 in Additional file 1). Poor binding 
is likely not a technical problem, because germline LIN- 
35 rescued the lin-SS mutant phenotype in the germline 
(Figure S2 in Additional file 1). Moreover, the epitope 
tag does not impair LIN-35 binding, as both somatic 
and intestine LIN-35:GFP bound chromatin proficiently. 
However, to ensure that the weak binding of germline 
LIN-35 did not reflect an undetected problem with this 
particular transgenic line, we utilized the promoter of 
mex-S to drive LIN-35:GFP expression in the germline; 



this line also rescued lin-35 mutant germline defects 
(Figures SI and S2 in Additional file 1). ChlP-seq analy- 
sis demonstrated that me;\:-5-driven LIN-35:GFP exhibits 
the same limited binding capability as pie- 1 -driven LIN- 
35:GFP (Figure 2a). Moreover, Pmex-5:UN-35:GF? 
binding sites overlap extensively with binding by germ- 
line EFL-1 (Figure 2b), similar to ?pie-l:UN-35:GT? 
(Figure S3A in Additional file 1). This result confirms 
that the binding capability of LIN-35 is impaired specifi- 
cally in the germline. 

Reduced LIN-35 binding in the germline might occur 
because LIN-35 has a significantly reduced role in com- 
plexes in which EFL-1 and DPL-1 activate gene expres- 
sion, consistent with the canonical model for Rb/E2F 
function in which the dissociation of a pocket protein 
switches E2F from repressor to activator [25]. Alterna- 
tively, LIN-35 binding might be restricted to a subset of 
germ cells. We speculated that LIN-35 might act specifi- 
cally in the mitotic progenitor cells of the germline to 
prevent premature activation of EFL-l/DPL-l-regulated 
genes, which are poorly expressed in progenitor cells 
and strongly upregulated by EFL-l/DPL-1 as germ cells 
initiate meiosis and gametogenesis [11]. Consistent with 
this possibility, the EFL-l/DPL-1 target gene lip-1 exhi- 
bits increased mRNA and protein levels in the progeni- 
tor cells of lin-SS mutants [26]. 

To distinguish between these two possibilities, we per- 
formed in situ hybridization of wild-type and lin-SS 
mutant gonads to examine the spatial expression pattern 
of four target genes {par-S, egg-1, rme-2, and chs-1) 
bound in the germline by LIN-35, EFL-1 and DPL-1 
(Figure 2c; data not shown). However, none exhibited 
expanded expression into the progenitor cell population 
of lin-SS mutants. Instead, overall expression appeared 
mildly reduced in the proximal gonad in the mutant 
compared to wild type. This result is consistent with 
LIN-35 having a minimal role in complexes in which 
EFL-l/DPL-1 is functioning as an activator, rather than 
acting specifically in progenitor germ cells. By contrast, 
in somatic tissues LIN-35 binding is extensive and the 
complex primarily inhibits gene expression. Thus, tis- 
sue-specific regulation of the association of LIN-35 with 
EFL-l/DPL-1 might be a key factor determining whether 
the complex activates or represses gene expression. 

Tissue-specific target genes are differentially regulated in 
//n-35, efl-1 and dpl-1 mutants 

To investigate how tissue-specific target genes are regu- 
lated by EFL-1, DPL-1, and LIN-35, we compared the 
germUne-specific, soma-specific, intestine-specific, and 
broadly bound targets to data from three published micro- 
array studies on efl-1, dpl-1, and/or lin-SS mutants, in the 
same stages and/or tissues (Figure 3a; Additional file 6). 
One study analyzed gene expression in dissected gonads. 
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Figure 2 LIN-35 exhibits reduced binding in the germline. (a) Example of binding across tine entirety of diromosome I for eacli factor in tine 
germline (indicated by tine promoter, either Ppie-1, red, or Pmex-5, green), and for LIN-35 in the soma {Plin-35, blue) and the intestine {Pges-1, 
orange). Three individual genes are shown below, two of which are germline-specific and show little (left) or no (middle) LIN-35 binding, and 
one that is broadly bound and shows LIN-35 binding in somatic tissues and not in the germline. (b) Venn diagram comparing overlap of 
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identifying 74 genes with down-regulated expression in 
both efl-1 and dpl-1 mutant gonads and 88 genes with up- 
regulated expression in lin-35 mutant gonads [11]. Two 
other studies identified up- and down-regulated genes in 
the soma of lin-SS mutant LI larvae, either at 20°C [20], 
or at 26°C [10]. These latter two studies have significant 
overlap (55% of up-regulated genes from the smaller 20°C 
gene list; P < 8.0e'^^^, hyper geometric probability test). 

Comparison of the germline-specific target genes with 
the genes regulated in dissected gonads of efl-1, dpl-1 and 
lin-35 mutants showed that 36 of the 74 down- regulated 
genes were bound by germline EFL-1 and DPL-1 {P < 
1.03e'^^), consistent with EFL-1 and DPL-1 acting directly 
to promote gene expression in the germline. Only two 
germline-specific target genes {unc-101 and hsf-2) were 
differentially expressed in lin-35 mutant gonads, indicating 
that LIN-35 is not required for the correct expression 
levels of most targets, and consistent with the limited 
binding by LIN-35 in the germline. The broadly bound 
candidate gene targets were over-represented among 
somatic LI lin-35 up-regulated genes (20°C, P < 2.8 le'^^; 
and 26°C, P < 1.82e'^^), consistent with a subset of these 
genes being bound and down-regulated by LIN-35 in var- 
ious somatic tissues. Surprisingly, the soma-specific and 
intestine-specific targets did not overlap significantly with 
LI lin-35 regulated genes at 20°C or 26°C. 

Indeed, the majority of tissue-specific candidate gene 
targets from any group were not significantly regulated in 
the microarray analyses. Transcription factor binding and 
target gene regulation typically show a poor correlation, 
which could be due partly to incorrect target assignment 
or shortcomings with the microarray analysis (reviewed in 
[27]). We therefore tested whether 'unregulated' direct 
gene targets were in fact regulated by EFL-1 /DPL-1 and/ 
or LIN-35, using qRT-PCR for several candidates in each 
category. Of five germline-specific targets tested, all five 
had decreased expression in dpl-1 mutant gonads relative 
to controls (Figure 3b), suggesting that most germline- 
specific candidate target genes require DPL-1, and pre- 
sumably EFL-1 as well, for expression in the gonad and 
that they were missed in the microarray analysis. Addi- 
tionally, we examined expression of targets in the soma- 
specific, intestine-specific, and broadly bound sets in lin- 
35 mutant LI larvae raised at 26°C. Three broadly bound 
and two intestine-specific genes showed > 1.5-fold 
increased expression in lin-35 mutants relative to wild type 
(Figure 3c), but the rest exhibited little to no change in 
expression. This result suggests that LIN-35/EFL-1 /DPL-1 
inhibits expression of only a subset of the candidate gene 
targets in these categories. We conclude that a binding 
event is much more likely to directly affect expression levels 
of candidate target genes in the germline than in the soma. 

Finally, a recent study monitored the expression of 
several candidate SynMuvB target genes in the soma of 



young adults lacking a germline, in which various DRM 
and heterochromatin SynMuvB complexes were inacti- 
vated by mutation or RNAi [8]. This analysis defined 
four DRM-specific targets {spn-4, mut-2, rde-4, and drh- 
3), two heterochromatin-complex-specific targets {wago- 
1 and wago-10), and seven 'common' targets regulated 
by both complexes. We therefore examined whether 
these genes were tissue-specific direct targets of EFL-1, 
DPL-1, and LIN-35. wago-1 and wago-10 are germline- 
specific targets, while spn-4 and mut-2 are broadly 
bound, drh-3 is not bound, and rde-4 exhibits a complex 
binding pattern in somatic tissues that was not classi- 
fied. Two of the common targets {pgl-3 and wago-9) are 
germline-specific and five were not significantly bound. 
Thus, we did not find a strict correlation between bind- 
ing profile and regulation by the different SynMuvB 
complexes, although notably both heterochromatin 
complex-specific genes were in the germline-specific 
category, while none of the DRM-specific genes were. 

mes-4 is a direct target of LIN-35/EFL-1 /DPL-1 in the soma 

The tissue-specific binding profiles permit identification of 
key targets that might contribute to the adoption of germ- 
line-characteristics in the somatic tissues of lin-35 mutants. 
As described above, we found little overlap between lin-35 
mis- regulated genes and those bound by LIN-35, EFL-1, or 
DPL-1 in the soma (Figure 3 A in Additional file 6), sug- 
gesting that LIN-35-mediated repression of germline genes 
in the soma at this stage is largely indirect. 

Strikingly, one of the few genes both bound by LIN-35 
specifically in the soma, and differentially regulated in 
lin-35 mutants, is mes-4 (Figure 4a). We confirmed mes- 
4 transcript induction in lin-35 mutant Lis relative to 
wild type by qRT-PCR (Figure 4b), consistent with pre- 
vious microarray experiments [20]. Moreover, in lin-35 
mutants, EFL-1 binding at the mes-4 promoter is vastly 
reduced, whereas most other direct target genes still 
retain EFL-1 binding, suggesting that regulation of mes- 
4 is specifically disrupted in lin-35 mutants (Figure 4c; 
Figure S6A in Additional file 1). 

mes-4 is especially intriguing as a direct target because 
its activity is essential for the germline-to-soma transfor- 
mation: mes-4; lin-35 mutants exhibit a significantly 
reduced larval arrest and reduced levels of somatic germ 
granules relative to lin-35 mutants [9,10]. It also has the 
ability to suppress multiple other lin-35 related pheno- 
types [28]. MES-4 encodes an H3K36 histone methyl- 
transferase that acts primarily in the bodies of genes 
expressed in the germline, presumably so that they can 
be re-expressed in germ cells in the next generation 
[29]. These observations have led to the working model 
that LIN-35 antagonizes the pro-germline influence of 
MES-4. Our data suggest that LIN-35 does so, at least 
in part, by binding directly to the mes-4 gene in the 
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Figure 3 Tissue-specific target genes in //n-35, efl-1, and dpl-l mutants are differentially regulated, (a) Over-representation of tissue 
specific candidate gene targets in different publislied microarray expression datasets. Tine expression datasets on tine x-axis labeled E2F gonad 
and LIN gonad are from [1 1], while the 20°C data sets are from [20], and the 26°C data sets are from [10]. Statistically significant deviations are 
indicated by asterisks {P < l.Oe-05, hypergeometric probability test), (b) qRT-PCR results for selected germline-specific candidate target genes 
that were not considered regulated in the gonad microarray datasets. Fold difference of expression was compared between control {unc-4) and 
mutant {unc-4 dpl-l) RNA from dissected gonads, and normalized to hexokinase expression. Error bars indicate technical replicates. The dashed 
line indicates 1.5-fold difference, (c) qRT-PCR results for selected soma-specific (blue), intestine-specific (orange), and broadly bound (black) 
candidate target genes that did not show any regulation in the 20°C or 26°C LI microarray datasets. The fold difference of expression of each 
gene was compared between wild-type (N2) and Iin-35(n745) mutant Lis raised at 26°C, and normalized to actin {oct-3) expression. Error bars 
indicate technical replicates. The dashed line indicates 1.5-fold difference. 
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soma, to reduce mes-4 expression and prevent it from 
targeting germline-expressed genes inappropriately. In 
this manner, LIN-35 might prevent activation of an 
extensive germline gene expression program in the 
soma without directly binding each regulated gene. 



CSR-1 is required for the soma-to-germline 
transformation of lin-35 mutants 

The soma of lin-35 mutants also exhibits the germline 
characteristic of enhanced RNAi sensitivity [9]. 
One hypothesis is that various proteins involved in 
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RNAi-based pathways in the germline are mis- 
expressed in the soma of lin-SS mutants [9,30]. Exami- 
nation of the binding profiles of RNAi pathway genes 
showed that several are bound specifically in the germ- 
line, including genes encoding the Argonaute family 
proteins, csr-1, ppw-1, wago-1, wago-2, and wago-10, as 
well as the RNA-dependent RNA polymerase ego-1 
(Figure 5a; Figure S6B in Additional file 1). To test the 
effect of binding on their expression in the germline, 
we performed qRT-PCR on dissected gonads from 
wild-type and dpl-1 mutant animals (Figure 5b). 
Expression decreased in dpl-1 relative to wild-type, 
indicating that DPL-1, and presumably EFL-1 as well, 
contribute to germline expression of these genes. In 
the soma of lin-3S mutants, a subset of these genes, 
primarily csr-1 and ppw-1, are upregulated (Figure 5c), 
despite an absence of binding by LIN-35. We suspect 
that this regulation is stage specific, as Wu et al, [8] 
did not detect consistent regulation of csr-1 in young 
adults. The increased expression of these germline 
RNAi-related genes in the larval soma could be a con- 
sequence of elevated MES-4 activity. 

CSR-1 affects chromatin status [31] and chromosome 
segregation [32], suggesting that its upregulation in the 
soma might have a significant effect on the soma-to- 
germline transformation. We therefore tested whether 
loss of any of these small RNA regulatory genes could 
rescue the larval arrest phenotype of lin-35 mutants at 
26°C. We found that loss of csr-1, and to a lesser extent 
wago-1, rescued the high temperature arrest of lin-3S 
mutants (Figure 5d). Because CSR-1 binds to small 
RNAs (22G-RNAs) that are antisense to germline genes, 
we speculated that these small RNAs might be impor- 
tant for recruiting MES-4 to germline genes to promote 
their expression, or conversely, that MES-4 activity is 
necessary for CSR-1 to be appropriately targeted to 
germline genes. If either of these possibilities is true, 
then the genes corresponding to CSR-l-bound 22G 
RNAs should overlap significantly with the genes regu- 
lated by MES-4. We therefore compared genes targeted 
by CSR-1 22G RNAs [32] with MES-4 target genes [29], 
and found that 76% of the MES-4 targets have CSR-1- 
associated 22G RNAs (Additional file 7). Moreover, 
RNAi of mes-4 and csr-1 together does not further sup- 
press the lin-3S larval arrest phenotype compared to 
RNAi of either gene alone, suggesting that they might 
act in the same pathway (Figure 5d). Consistent with 
this possibility, wago-1 (RNAi) was less effective than csr- 
l(RNAi) at suppressing the lin-3S larval arrest, and the 
overlap between the top 100 genes targeted by WAGO- 
1 22G RNAs [33] and MES-4 targets was only 10%. We 
conclude that CSR-1 contributes to the soma-to-germ- 
line transformation of lin-3S mutants, at least in the 
intestine. 



LIN-35 and HPL-2 exhibit common specialized binding 
patterns in intestinal chromatin 

The intestine is the key tissue for mediating the high- 
temperature larval arrest phenotype of lin-35 mutants 
[10]. The distinction between the DRM and heterochro- 
matin complexes is not consistent in this tissue: for 
each complex, certain components are involved in the 
larval arrest {lin-35 and hpl-2) while others are not {efl- 
1 and lin-61) [8,10]. We found that LIN-35 and HPL-2 
exhibit a unique type of binding behavior in the intes- 
tine that could explain this discrepancy and provide a 
possible mechanism for the larval arrest phenotype. The 
subset of binding events we called 'intestine-specific' are 
bound primarily by LIN-35 and HPL-2, and sometimes 
exhibit weak binding by DPL-1 but essentially no bind- 
ing by EFL-1. Thus, these sites have minimal, if any, 
input by E2F (Figure 6a). As described previously, these 
intestine-specific binding sites exhibit several other dis- 
tinctive features, including a high proportion of sites 
that could not be assigned to specific gene targets, a 
paucity of target genes with germline expression, and 
enrichment on the X chromosome (Figure Ib-d). Addi- 
tionally, fewer genes in the intestine-specific set are 
associated with E2F consensus motifs (Figure S7 in 
Additional file 1; Additional file 8). Notably, intestine- 
specific binding sites cover almost twice as many 
nucleotides compared to other tissue-specific binding 
sites (Figure 6b), and are often found in gene bodies or 
intergenic regions instead of immediate upstream regu- 
latory regions (Figures la and 6a). 

These observations indicate that LIN-35 is likely 
recruited to these sites through a novel, intestine-speci- 
fic mechanism that includes HPL-2. Indeed, we find that 
HPL-2 binding is diminished in the absence of LIN-35 
binding in the intestine (data not shown). Thus, even 
though HPL-2 might regulate a different set of genes 
from the DRM complex in other tissues, in the intestine 
it acts only at a subset of LIN-35 binding events. Possi- 
bly, LIN-35 has multiple regulatory functions in the 
intestine, including a tissue-specific interaction with 
HPL-2 (either separately or as part of the heterochroma- 
tin complex) that is independent of EFL-1, as well as a 
more canonical, perhaps less tissue-specific, function in 
the DRM complex. 

Discussion 

The Retinoblastoma (Rb) tumor suppressor pathway is 
inactivated in tumors of diverse tissue origins at a very 
high frequency. Although intensively studied, the actual 
mechanisms by which the Rb pathway directs proliferation 
and differentiation within the tissue-specific restrictions 
imposed in vivo are poorly understood. Here, we address 
this limitation by developing a system in C. elegans to 
globally identify the tissue-specific chromatin interactions 
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Figure 5 22G small RNA pathways mediate the lin-35 larval arrest phenotype. (a) Binding profiles at loci encoding candidate small RNA 
pathway regulatory proteins, (b) qRT-PCR analysis of transcript levels of candidate small RNA pathway regulators in the dissected gonads of 
wild-type and dpl-l mutants. Error bars indicate technical replicates, (c) qRT-PCR analysis of transcript levels of candidate small RNA pathway 
regulators in LI larvae of wild-type and lin-35 mutants. Error bars indicate biological replicates, (d) Assay of the high temperature larval arrest 
phenotype of lin-35 mutants at 26°C upon RNAi of candidate small RNA regulators (listed on the x-axis) and empty vector control (L4440). 
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Figure 6 HPL-2 requires LIN-35 for recruitment to intestine-specific binding sites (a) Single gene examples of the typical binding profiles 
for intestine-specific targets, (b) Binding site width for each tissue-specific candidate target set. Error bars represent standard error. 



of the core members of this pathway, LIN-35/Rb, EFL-1/ 
E2F, and DPL-l/DP. A key advantage of this approach is 
that we were able to compare binding profiles between tis- 
sues to separate broadly bound sites from those present in 



individual tissues, thus focusing on the most biologically 
relevant binding events. This analysis revealed distinct sets 
of binding sites, with different candidate gene targets and 
modes of regulation in specific tissues. In a whole animal 
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analysis, the sheer number of broadly bound sites relative 
to tissue-specific sites would have obscured the distinct 
functions of the Rb/E2F complex in the different tissues. 
Many of the broadly bound sites correspond to gene tar- 
gets related to the best known mammalian E2F targets, 
such as cell cycle genes. Thus, our results suggest that 
many in vivo targets and much of the tissue-specific regu- 
lation by the Rb/E2F complex still remains to be discov- 
ered in mammalian systems. 

Tissue-specific relationships between LIN-35 and 
EFL-1/DPL-1 

In many or most somatic tissues, LIN-35 and EFL-1/ 
DPL-1 bind at many of the same targets. However, the 
tissue-specific binding profiles reveal that these factors 
do not always co-occupy the same binding sites, but exhi- 
bit uniquely bound sites distinct to particular cell types. 
The tissue-specific relationship between LIN-35 and 
EFL-l/DPL-1 binding correlates with effects on gene 
expression. In the germline, EFL-1 and DPL-1 frequently 
bind DNA in the absence of appreciable LIN-35 binding, 
and EFL-l/DPL-1 act independently of LIN-35 to pro- 
mote expression. In the soma, EFL-l/DPL-1 targets exhi- 
bit extensive LIN-35 binding, and their expression is 
either inhibited or apparently unaffected by LIN-35 
activity. 

One possible mechanism for how LIN-35 might be 
specifically inhibited from binding in the germline 
comes from mammalian studies that have shown that 
Rb is largely refractory to ChIP analysis in transformed 
cells (reviewed in [34]). The phosphorylation status of 
Rb apparently alters its association with chromatin: 
phosphorylated Rb shows poor binding, while a phos- 
phorylation-defective mutant has increased binding [35]. 
Therefore, one possibility is that post-translational regu- 
lation of LIN-35/Rb, perhaps by phosphorylation, limits 
its association with chromatin in a germline-specific 
manner. Because transformed cells and germline cells 
both represent undifferentiated cell types, the inability 
of LIN-35/Rb to effectively bind chromatin could be a 
general property of progenitor cells in vivo. 

By contrast to the germline, many intestine-specific 
binding sites exhibited strong LIN-35 binding in the 
absence of substantial binding by EFL-1. Although two 
other E2F-like proteins are encoded in the C. elegans gen- 
ome, neither appears functionally redundant with EFL-1. 
EFL-2 primarily has a role in regulating apoptosis [36], 
while F49E12.6 exhibits relatively poor binding by ChlP- 
seq and has little overlap with EFL-1 (data not shown). 
Moreover, very few genes in the intestine-specific set have 
an upstream consensus E2F sequence (unlike the other 
datasets), and the broad LIN-35 peaks are not restricted to 
promoter regions. All of these observations are consistent 
with the idea that LIN-35 can be recruited to multiple 



sites in the genome through a mechanism that does not 
depend on binding by an E2F-like protein. 

The intestine-specific peaks are very broad and have a 
lower correlation with annotated genes compared to the 
narrow peaks typically produced by sequence-specific 
transcription factor binding. Moreover, the intestine- 
specific set is not enriched for germline-expressed 
genes, and even exhibits a preference for sites on the X 
chromosome, which is opposite to the trends for the 
other categories of binding sites. These intestine-specific 
sites occur primarily in the intestine, as they are much 
reduced when the entire soma is assayed. These specia- 
lized sites could be the means by which LIN-35 med- 
iates certain intestine-specific functions, such as 
influencing endoreplication of intestinal nuclei [16] and 
guarding against a high temperature larval arrest [10]. 
Intriguingly, the heterochromatin-associated protein 
HPL-2 co-occupies these sites with LIN-35, and hpl-2 
mutants exhibit similar endoreplication defects and a 
similar larval arrest as lin-SS mutants. These sites might 
mark some tissue-specific chromatin conformation that 
serves as a point of entry for the replication machinery 
and/or is permissive for germline gene expression. 

Key targets involved in the soma-to-germline 
transformation of l'm-35 mutants 

The identification of tissue-specific target genes sheds 
new light on the diverse mechanisms by which LIN-35, 
EFL-1 and DPL-1 influence the fate and function of dif- 
ferent cell types. In the germline, a relatively straightfor- 
ward relationship exists between target genes and the 
defects in oogenesis and early embryogenesis of efl-1 
and dpl-1 mutants: EFL-l/DPL-1 directly bind at and 
promote the expression of many genes known to act in 
oogenesis and embryogenesis. The situation in the soma 
is more complex, at least in the LI animals in which we 
analyzed binding profiles. Most somatic target genes 
directly bound by LIN-35, EFL-1, and DPL-1 have unal- 
tered transcript levels in lin-35 mutants either by micro- 
array or qRT-PCR analysis. The SynMuv A pathway is 
functionally redundant with the SynMuv B pathway, and 
its activity might compensate for certain phenotypes of 
lin-35 mutants, stabilizing expression of a subset of 
direct targets. 

Strikingly, mutation of lin-35 results in substantial up- 
regulation of many genes that do not have LIN-35 bind- 
ing nearby. How expression of these indirect targets is 
affected is unknown, but one direct target gene, mes-4, 
might link LIN-35 DNA binding with indirect effects on 
regulation of a subset of these genes, mes-4 encodes an 
H3K36 histone methyltransferase that preferentially acts 
on germline-expressed genes to promote their expression 
in the germline of the next generation [29], and its activ- 
ity is essential for the soma-to-germline transformation 
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of lin-SS mutants. MES-4 activity and LIN-35 activity 
could be in separate pathways that converge to oppositely 
regulate common gene targets, but our data suggest that 
their relationship is linear, at least in some tissues. LIN- 
35/EFL-l/DPL-l binds to the promoter of mes-4, and 
limits its expression in at least a subset of somatic tissues. 
In lin-35 mutants, EFL-1 no longer binds the mes-4 
locus, whereas most other EFL-1 binding sites persist, 
indicating that mutation of lin-3S disrupts the DRM 
complex more extensively at mes-4 than other direct tar- 
get genes. Ectopically expressed MES-4 then inappropri- 
ately promotes somatic expression of germline-expressed 
genes. Ultimately, these upregulated genes in the soma 
are likely to mediate much of the conversion from 
somatic to germline characteristics. 

Indeed, several indirect somatic targets were of parti- 
cular interest in mediating this phenotype, such as those 
acting in germline-specific small RNA pathways. Strik- 
ingly, these genes are direct targets of EFL-l/DPL-1 in 
the germline but not in somatic tissues of wild-type ani- 
mals. We wondered whether the EFL-l/DPL-1 binding 
sites utilized in the germline could become accessible to 
EFL-l/DPL-1 in the soma in the absence of LIN-35 
activity, but found that EFL-1 is not recruited to these 
loci in lin-3S mutants (Figure S6C in Additional file 1), 
suggesting some other mechanism for their regulation. 
Potentially, MES-4 promotes their expression in lin-35 
mutants instead. 

Given the enhanced RNAi sensitivity of lin-35 
mutants, we tested several of these small RNA pathway 
genes for a key role in the soma-to-germline transfor- 
mation, and found that reduction of wago-1 or csr-1 
activity suppressed the lin-35 larval arrest phenotype. 
WAGO-1 and CSR-1 bind to distinct pools of small 
22G RNAs that target different classes of genes [32,33]. 
In particular, CSR-1 -associated 22G RNAs match genes 
expressed in the germline. However, whether these 22G 
RNAs contribute to ectopic germline gene expression in 
the soma of lin-35 mutants still requires exploration. 
Intriguingly, reduction of both csr-1 and mes-4 activity 
simultaneously does not lead to greater suppression of 
the lin-35 larval arrest phenotype than either alone, and 
MES-4 and CSR-l-associated 22Gs appear to target lar- 
gely overlapping sets of germline-expressed genes. Possi- 
bly, CSR-1 and MES-4 might cooperate to mark and 
promote the expression of germline genes in lin-35 
mutants. 

Thus, the tissue-specific binding profiles led to the 
implication of specific components of small RNA path- 
ways as essential mediators of the soma-to-germline trans- 
formation of lin-35 mutants. Precedence exists for one or 
a few indirect target genes playing a key role in a promi- 
nent lin-35 mutant phenotype: de-regulation of a single 



target gene, lin-3, is sufficient to induce the multivulva 
phenotype of SynMuv mutants [37]. Strikingly, like csr-1, 
lin-3 also appears to be an indirect target of LIN-35 in the 
soma, at least in LI animals (data not shown). 

Conclusions 

We present the first in depth examination of tissue-spe- 
cific binding by the Rb/E2F regulatory pathway in vivo. 
These data highlight unique and sometimes unexpected 
properties of this pathway in different tissues, and 
clearly demonstrate that Rb/E2F have specialized roles 
in both progenitor and differentiated cell types. Future 
studies should be directed toward investigating many of 
the individual gene regulatory events that could play key 
roles in mediating the tissue-specific phenotypes of this 
intriguing master regulator. 

Materials and methods 

Strain maintenance 

Nematode strain maintenance was as described [38]. 
C. elegans strain N2 was used as the wild-type strain, in 
addition to the following variants: LGI, lin-35(n745); 
LGII, unc-4(el20X dpl-l(n3316) unc-4(el20)/mnCl dpy- 
10(el28) unc-52(e444)) (MT9940); LG III, unc-119(ed3). 
All experiments were conducted at 20°C unless other- 
wise indicated. 

Transgene construction and analysis 

Tissue-specific transgenes were constructed using the Mul- 
tisite Gateway Cloning system (Invitrogen, Carlsbad, CA, 
USA). The upstream regulatory sequences from lin-35, efl- 
1, dpl-1, and ges-1 were cloned into pDONRP4PlR, and the 
pie-1 regulatory sequence in this vector (pCG142) was pur- 
chased (Addgene, Cambridge, MA, USA). The genomic 
sequences of dpl-1, lin-35, efl-1 and hpl-2 were amplified 
from N2 genomic DNA and cloned into pDONR201. The 
GFP:FLAG sequence was amplified from LIN-28::GFP: 
FLAG (a gift from Giovanni Stefani) and then PCR-stitched 
to the endogenous 3' UTR of each gene and cloned into 
pDONRP2RP3. All primer sequences are available upon 
request. Each entry clone was verified by sequencing before 
recombination into the destination vector, pCGlSO 
(Addgene) using LR Clonase II Plus (Invitrogen). The 
resulting constructs, which contain an unc-119 rescue frag- 
ment, were then transformed into unc-119(ed3) worms 
using microparticle bombardment [39]. At least one inde- 
pendent, low copy number, integrated line was generated 
for each fusion construct. GFP expression of each construct 
was visualized using a Zeiss Axioplan with DIC and 488 
wavelength for GFP. Images were collected using a Zeiss 
AxioCam MRm camera and processed using Axiovision 
software (Zeiss, Oberkochen, Germany). Supplemental 
Table 1 in Additional file 1 lists all strains used in this study. 
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ChlP-seq 

Chip assays were conducted as previously described 
[12,14]. Worms were staged by bleaching and LI starva- 
tion. The starved LI larvae were placed on OP50 bac- 
teria for 6 hours for LI collection at 20°C, or for 

4 hours for LI collection at 26°C. Young adult collection 
was performed after 62 hours at 20°C. Samples were 
crosslinked with 2% formaldehyde for 30 minutes at 
room temperature and then quenched using 1 M Tris 
pH 7.5. The pelleted worms were then quick frozen in 
liquid nitrogen and stored at -80°C. Samples were soni- 
cated to obtain 200 to 800 bp DNA fragments. For each 
sample, 2.2 mg of cell extract was immunoprecipitated 
using 7.5 [ig of goat anti-GFP (gift from Tony Hyman), 
anti-IgG (R&D Systems, Minneapolis, MN, USA), or 

5 \ig anti-EFL-1 (Novus Biologicals, Littleton, CO, USA) 
antibodies. The enriched DNA fragments and input con- 
trol (genomic DNA from same sample) were used for 
library preparation as previously described [12] in order 
to perform deep sequencing on the lUumina GA2 plat- 
form. A multiplex adaptor system was used to enable 
sequencing of four samples in each flow cell as pre- 
viously described [40]. Table S2 in Additional file 1 con- 
tains the number of reads for each sample and replicate 
used in the analyses. 

The raw data were processed as previously described 
[14]. Correlation analysis, peak calling and gene target 
assignment were also as previously described [14,19]. 
Briefly, for correlation analysis, we pooled raw signals 
from two biological replicates, normalized against input 
and used PeakSeq [19] to find peak regions of each fac- 
tor from the pooled reads as well as for each replicate. 
Correlation between two biological replicates was deter- 
mined by binning the binding peaks called by PeakSeq 
for each replicate from pooled reads (q-value cutoff of 
0.001) into non-overlapped 100-nucleotide windows to 
avoid variation from peaks of different widths [19]. Raw 
reads at each window were counted from both replicates 
and used to calculate the Pearson correlation coefficient 
between replicates. Note that the number of binding sites 
ascribed to each tissue-specific dataset is not equivalent 
to the number of gene targets for that set. Some binding 
sites are not assigned to any target gene, and other bind- 
ing sites are assigned to more than one candidate target. 
For instance, 415 germline-specific binding sites were 
assigned to 379 target genes. All ChlP-seq data have 
been deposited in Gene Expression Omnibus (GEO), 
under accession number GSE30246. 

Bioinformatic analysis of binding sites 

To determine the functional categories of genes asso- 
ciated with each set of tissue-specific binding sites, we 
used DAVID [41] to assign GO terms to the genes. The 
'molecular function' category from each tissue-specific 



dataset was sorted by significance and fold-enrichment as 
determined by DAVID, and redundant or overlapping 
categories were manually removed. The top ten cate- 
gories with enrichment greater than two-fold and a P < 
0.05 (modified Fisher exact test) were then graphed as in 
Figure 3e. The raw output for this analysis is provided in 
Additional file 5. 

We compared genes known to be regulated in the germ- 
line based on [21] with each tissue-specific dataset. For 
this comparison, only the 'intrinsic' and 'oogenesis- 
enriched' genes, totaling 2,218 genes (approximately 10% 
of the total genes in the genome), were considered germ- 
line-expressed. Pearson's chi square analysis was per- 
formed to determine the significance of over- 
representation. To determine if bound genes were signifi- 
cantly over-represented among differentially regulated 
genes in previous microarray experiments, we collected 
lists of differentially regulated genes from [10,11,20] based 
on the criteria of each study. The tissue-specific gene tar- 
gets were then compared with each list of differentially 
regulated genes for overlap. A hypergeometric probability 
test was utilized to determine the significance of the over- 
lap [42]. 

Gene expression analyses 

qRT-PCR analysis was carried out as follows. Wild-type 
and lin-3S(n74S) Ll-staged animals were grown at 20°C to 
the gravid adult stage and then bleached to isolate 
embryos. Embryos were cultured in S-basal at 26°C until 
the following day when they were placed on OP50 plates 
for 4 hours at 26°C until they were collected for RNA iso- 
lation. unc-4 control and dpl-1 mutant starved LI animals 
were grown at 20°C for 72 hours on OP50 plates before 
dissected gonads were harvested. Adult worms were 
placed in dissection buffer (M9 with 0.1% levamisole and 
0.001% Tween20) on a coverslip. We used 30 1/2 gauge 
needles to extrude approximately 112 gonad arms from 
each genotype, excising each just proximal to the sper- 
matheca. Dissected gonads were carefully transferred to an 
eppendorf containing Trizol (Invitrogen). Total RNA from 
each sample (LI animals and dissected gonads) was iso- 
lated using Trizol and then DNase treated with DNA-free 
(Ambion, Austin, TX, USA). Total RNA (250 ng) from 
each genotype was reverse transcribed using the Omnis- 
cript RT kit (Qiagen, Valencia, CA, USA). Gene-specific 
PCR was performed in duplicate for both RT and no RT 
conditions using the same protocol as ChlP-qPCR, or 
using Brilliant II SYBR Green QPCR Low Rox on the Stra- 
tagene Mx3000P system (Agilent Technologies, Santa 
Clara, CA, USA) with three-step cycling with an annealing 
temperature of 55°C followed by a dissociation program. 
Cycle threshold (Ct) values were normalized using primers 
specific to the housekeeping hexokinase gene, H25P06.1, 
or acts. 
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In situ analysis was carried out according to [43] with 
modifications as described in [44]. Probes were prepared 
from partial cDNAs cloned in the pCR2.1 vector (Invi- 
trogen). cDNA fragments used for rme-2 were nucleo- 
tides 1 to 1,029 and nucleotide 2,006 to 3,074 for par-3 
where numbering begins at the ATG in the predicted 
spliced cDNA. Gonads were stained with BCIP/NBT 
tablets (Roche, Mannheim, Germany) for 3 hours for 
rme-2 and 5 hours for par-3, mounted, and viewed 
using a Zeiss Axioplan 2 imaging epifluorescence 
microscope. 

The published microarray data used in this manuscript 
for comparison to the ChlP-seq data are in the GEO with 
accession numbers GSE26823-GSE26825 [10], GSE5071 
[11], GSE6547 [20] and GSE715-GSE737 [21]. 

RNAi and high-temperature arrest assay 

RNAi by feeding was performed using clones from the 
Ahringer library [45]. Overnight cultures were grown at 
37°C in Luria broth containing 50 (ig/ml ampicillin and 
12.5 (ig/ml tetracycline and then a 5 hour culture was 
grown in Luria broth with 50 (ig/ml ampicillin before 
plating on NGM plates containing 1 mM isopropyl P-D- 
1-thiogalactopyranoside (IPTG) and 50 (ig/ml ampicillin. 
lin-3S(n74S) L4 animals were placed on feeding bacteria 
for at least 18 hours at 26°C and then transferred to 
fresh RNAi plates and allowed to lay embryos for 5 to 6 
hours. Progeny were scored for larval arrest 3 to 4 days 
later. In order to bypass the embryonic lethality of ego-1 
(RNAi), cultures were 1:1 diluted with L4440 empty 
vector. 

Additional material 



Additional file 4: Supplementary file 3. Targets overlapping with LIN- 
54 ChlP-chip study [15]. 

Additional file 5: Supplementary file 4. GO categories for each tissue 
specific target gene set returned by DAVID, and summary page showing 
selected categories for Figure 3e. 

Additional file 6: Supplementary file 5. Genes regulated by microarray 
analyses with overlaps to tissue-specific binding sites, along with 
statistical analysis. 

Additional file 7: Supplementary file 6. Genes targeted by 22G RNAs 
[32] compared to MES-4 target genes [29]. 

Additional file 8: Supplementary file 7. Precise sequences associated 
with each motif identified by MEME for each set of tissue-specific 
binding sites. 
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